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1600 



^ 1 !003 



3 <110> APPLICANT: de CROMBRUGGHE, BENOIT. 

4 NAKASHIMA, KAZUHISA 

5 ZHOU, XIN 

7 <120> TITLE OF INVENTION: MASTER BONE FORMATION TRANSCRIPTION FACTOR: 

8 COMPOSITIONS AND METHODS OF USE 
10 <130> FILE REFERENCE: UTXC:666 

C — > 12 <140> CURRENT APPLICATION NUMBER: US/09/734 , 329A 

13 <141> CURRENT FILING DATE: 2000-11-30 
15 <160> NUMBER OF SEQ ID NOS : 6 

17 <170> SOFTWARE: Patentin Ver. 2.1 . 

19 <210> SEQ ID NO: 1 

20 <211> LENGTH: 2960 

21 <212> TYPE: .DNA K I IT^ D ff^ 

22 <213> ORGANISM: Mus musculus t iNl 1 t 11 t )^ 

24 <220> FEATURE: 



25 <221> NAME/KEY 

26 <222> LOCATION 
28 <400> SEQUENCE 



CDS 

(100) . . (1383) 
1 

29 attctcccat tctccctccc tctcccttct ccctctccca ctggctcctc ggttctctcc 60 

31 atctgcctga ctccttggga cccggtcccc agctcgagg atg gcg tec tct ctg 114 

32 Met Ala Ser Ser Leu 

33 15 

35 ctt gag gaa gaa get eae tat gge tee agt eee etg gee atg ctg act 162 

36 Leu Glu Glu Glu Ala His Tyr Gly Ser Ser Pro Leu Ala Met Leu Thr 

37 10 15 20 

39 gca gee tgc age aaa ttt ggc gge tct age cct ctg egg gae tea aea 210 

4 0 Ala Ala Cys Ser Lys Phe Gly Gly Ser Ser Pro Leu Arg Asp Ser Thr 

41 25 30 35 

43 ace etg ggg aaa gga gge aea aag aag eea tac get gac ctt tea gee 258 

4 4 Thr Leu Gly Lys Gly Gly Thr Lys Lys Pro Tyr Ala Asp Leu Ser Ala 

45 40 45 50 ' 

47 eee aaa aec atg ggg gac gee tac eea get ccc ttc tea age aee aat 306 

4 8 Pro Lys Thr Met Gly Asp Ala Tyr Pro Ala Pro Phe Ser Ser Thr Asn 

49 55 60 65 

51 gga etc etc tct cct gca ggc agt eet eeg gee eea gcc tct gge tat 354 

52 Gly Leu Leu Ser Pro Ala Gly Ser Pro Pro Ala Pro Ala Ser Gly Tyr 

53 70 .75 80 . 85 

55 gca aat gac tac cea ecc ttc cct cac tea ttt eet ggg eee aee ggt 402 

56 Ala Asn Asp Tyr Pro Pro Phe Pro His Ser Phe Pro Gly Pro Thr Gly 

57 90 • 95 100 

59 gee eaa gac cct ggg etc eta gtg cct aag ggg cac age teg tct gae 450 

60 Ala Gin Asp Pro Gly Leu Leu Val Pro Lys Gly His Ser Ser Ser Asp 

61 105 110 115 
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63 tgc 


ct g 


cct 


agt 


gt c 


tac 


act 


tec 


c t g 


gat 


^t g 


act 


cat 


ccc 


tat 


yy ^ 


4 98 


64 Cys 


Leu 


Pro 


Ser 


Val 


Tvr 

X _y J. 


Thr 


Ser 


Leu 


Asp 


Met 


Thr 


His 


Pro 


Tvr 

J. _y J. 


Glv 








120 










X ^ -J 










130 










67 teg 


i_y y 


tac 


aag 




yyo 


ate 




gca 


y y u 


ate 


tea 


cca 


yy u 


cca 


yy ^ 


54 6 


68 Ser 


Trn 


Tyr 


Lys 


Ala 


Gly 


He 


His 


Ala 


Gly 


lie 


Ser 


Pro 


Gly 


Pro 


Gly 






± o J 










X f± u 






















7 1 33.C 


aca 


cct 


act 




uyy 


uyg 


y 


d L. y 


O ^5 O 


cct 


yyy 


y y 


aac 


Lyy 


L- d 


594 


72 Asn 


Thr 


Pro 


Thr 


Pro 


i xp 


1 xp 


Asp 


Met 


His 


Pro 


Gly 


Gly 


Asn 


TTn 
1 xp 


Leu 




7 1 

f O J. .J u 










J. ■u 










1 fin 

X vJ U 










X u ^ 




1 R rrrrl- 

/ o y y ^ 


ggt 


ggt 


cag 


ggc 


cag 


gg^ 


y a l_ 


ggg 


c t g 


caa 


ggg 


aca 


ct g 


tee 


aca 


f\AO 


7 6 Gly 




Gly Gin 




Gin 


vu-L y 


rto p 


\j J- y 


XjtS u 


Gin 


Gly 


Thr 


Leu 


Ser 


Thr 




7 7 








1 70 










1 7 R 

X / J 










1 PO 
X o u 






7 9 ggc 


cct 


gcc 


cag 


cct 


cca 


c t g 


aac 


ccc 


cag 


ct g 


cct 


dC L 


"t- 3 o 


cca 


L C L 




ft 0 m V 


Pto 


Ala 


Gin 


C J. \J 


Pto 

IT J-\J 


T .on 

xj C Li. 


Asn 


P TO 


Gin 


T .on 

Xj C Ll 


Pto 


Thr 


lyx 


Pro 


OCX 




81 






185 










1 QO 
X _? u 










X ;? 








83 gac 


ttt 


get 


cec 


C L. U 


aac 


cca 


get 


cec 


■H a o 

Lac 


cca 


gcg 


ccc 


cac 


ct e 


L. ug 


7 T 

/JO 


84 Asp 


Phe 


Ala 


Pro 


Leu 


r^ol 1 




X d 


Pto 

t X (J 


iyr 


Pto 

IT J-KJ 


r\JL d 


Pto 


nx o 


T .on 


T .on 

XlC Li 




85 




200 










90R 
z u o 










91 0 
X u 










87 caa 


cca 


ggg 


ccc 


cag 


cai- 


gt e 


c na 


ccc 


caa 


gat. 


gt c 


•H a 1- 


aag 


CCC 


aag 


7 P 
/ o D 


88 Gin 


Pro 


Gly 


Pro 




n X o 


V ctx 


Leu 


IT X U 


VjXll 


A cj-ni 
rt-op 


V d X 


iyr 


Lys 


Pto 
t X vj 


Lys 




89 


215 










z z u 










9 9 R 

z z o 












91 gcg 


gtt 


gge 


aat 


agu 


ggg 


caa 


ct g 


gsg 


ggg 


agr 


ggt 


gca 


gee 


aaa 


cec 


O -DH 


92 Ala 


Val 


Gly Asn 




oxy 


r 1 n 


Leu 


ox u 


oxy 


O t;X 


ux y 


A 1 3 
rlX d 


Al 3 


Lys 


irX (J 




93 230 










Z J o 










94 0 










94 R 




95 cct 


egg 


ggt 


get 


ggc 


aca 


ggg 


ggc 


age 


ggt 


gga 


■1- -1- 

r ar 


gcg 


ggc 


agr 


ggg 


Q Q 9 
O O Z 


96 Pro Arg 


Gly Ala 


v3_L y 


1 llX 


\r 
yjj-y 




O fcJX 


ox y 


ox y 


iyr 


A 1 ^ 
rlX d 


ox y 


T 

OcX 


vjx y 




97 








9 R fl 

z o u 










9 R 
ZOO 










9 0 

o u 






99 gca 


ggg 


cgt 


tct 


acc 


tgc 


gac 


tgc 


ccc 


aac 


tgt 


cag 


gag 


c L.a 


gag 


egg 




100 Ala 


Gly 


Arg 


Ser 


Th r 
J. ill. 








Pto 
It X 


A 


^y o 


ox 


vJX Li 


T .on 
Xj t: LI 


m n 

ox Li. 


ATrr 
rt.x y 




101 






265 










97 0 










97 R 








103 etc 


ggg 


gca 


gca 


gcg 


get 


ggg 


ct g 


^gg 


aag 


aag 


ccc 


a 1- 1- 
d L L. 


■ cac 


age 


tgc 




104 Leu 


Gly 


Ala 


Ala 


Ala 


Ala 


Gly 


Leu 


Arg 


Lys 


Lys 


Pro 


lie 


His 


Ser 


Cys 




105 




280 










^ O -..J 










^QO 










107 cac 


ate 


cct 


ggg 


u y ^ 


ggc 


^3 ^5 rr 
day 




1- r- 
L. d o 


y 


ddy 


00"t* 

y L. 


+" OO 

y 




o+* o 
u y 


;^ ^ ^ 
ddd 


102 6 


108 His 


He 


Pro 


Gly 


Cys 


Gly 


Lys 


Val 


1 y X 


Gly 


Lys 


Ala 


Ser 


His 


Leu 


Lys 




109 


295 










300 










305 












111 gcc 


cac 


ttg 


cgc 


Lyy 


cac 


act 


y y ^ 


y "y 


dy y 


cct 


ttc 


gt c 


tgc 


aac 


"-yy 


107 4 


112 Ala 


His 


Leu 


Arg 


Tro 


His 


Thr 


Gly 


Glu 


Arg 


Pro 


Phe 


Val 


Cys 


Asn 


Tro 
± J_p 




113 310 










315 










320 










325 




115 ctt 


ttc 


tgc 


ggc 


aag 


agg 


ttc 


act 


cgc 


tct 


gac 


gag 


ctg 


gag 


cgc 


cac 


1122 


116 Leu 


Phe 


Cys 


Gly 


Lys 


Arg 


Phe 


Thr 


Arg 


Ser 


Asp 


Glu 


Leu 


Glu 


Arg 


His 




117 








330 










335 










34 0 






119 gtg 


cgc 


act 


cac 


acc 


egg 


gag 


aag 


aag 


ttc 


act 


tgc 


ctg 


etc 


tgt 


tec 


1170 


120 Val 


Arg 


Thr 


His 


Thr 


Arg 


Glu 


Lys 


Lys 


Phe 


Thr 


Cys 


Leu 


Leu 


Cys 


Ser 




121 






345 










350 










355 








123 aag 


cgc 


ttt 


acc 


aga 


age 


gac 


cac 


ttg 


age 


aaa 


cat 


cag 


cgc 


acc 


cac 


1218 


124 Lys 


Arg 


Phe 


Thr 


Arg 


Ser 


Asp 


His 


Leu 


Ser 


Lys 


His 


Gin 


Arg 


Thr 


His 




125 




• 360 










365 










370 










127 ggg 


gag 


cca 


ggc 


ccg 


gga 


ccg 


ccc 


cca 


agt 


ggc 


cct 


aag 


gag 


ctg 


ggg 


1266 
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128 


Gly 


Glu 


Pro 


Gly 


Pro 


Gly 


Pro 


Pro Pro Ser Gly Pro Lys Glu Leu 


Gly 




129 




375 










380 


385 






131 


gag 


ggt 


cgc 


age 


gtc 


ggg 


gaa 


gaa gaa gee aat eag eeg cec ega 


tet 


1314 


132 


Glu 


Gly 


Arg 


Ser 


Val 


Gly 


Glu 


Glu Glu Ala Asn Gin Pro Pro Arg 


Ser 




133 


390 










395 




400 


405 




135 


tec 


act 


teg 


cct 


gca 


ecc 


cea 


gaa aaa gee cac gga gge age eea 


gag. 


1362 


136 


Ser 


Thr 


Ser 


Pro 


Ala 


Pro 


Pro 


Glu Lys Ala His Gly Gly Ser Pro 


Glu 




137 










410 






415 420 






139 


cag 


age 


aac 


ctg 


eta 


gag 


ate 


tgageegggt agaggaaggt etceagetee 


1413 


140 


Gin 


Ser 


Asn 


Leu 


Leu 


Glu 


He 









141 



425 



143 agggteetet tgecaggetc tettggcgtg ctggacceat tggttgeeee tegetetetc 
145 etattgeatg ctataetetg ggggetetct etgtteeeet aggetatete ettgeatgte 
147 teeteagtte ttctetettt gteaagagte ttagccaaae teeteteagg eetttgeeag 
14 9 tgeetagttc ctatgeteeg aeeteeteaa etttttette tetgeeeetg ttcttcaeag 
151 etteeatetg geetcacatc attttcteat taaetegttg eeatetaate tttetgettc 
153 ccaateetat ttgeegtttt cccgaagett ecaggetgtc gcetcgatte eeeeeeaeet 
155 ttegtettee tgagetttgt gttttctttt tttaaacaaa eaegatgatg atgatgatga 
157 tgatgataat ttattgeeee ctggtgttet teattaggaa ceagagttaa ggagattggt 
159 gttagtaaee tggeegggag cagagtgcca agaaggggga agtccaatgg gga tet gate 
161 eeaaagatgg ggtgaeeeea gggteaggga ggetgeeeee ageettgagt acttaaeeee 
163 tatgegeeag gagtaaagaa tagtaatagt aataataata ataattetat ttatetaagt 
165 tatgatgaeg ggtcaggtae agtgagetgg agagggaaag ggatteteee egeeeeeaag 
167 gaaattetag teaaatgeat ctetgtatag aeaaatgata gtggagacct tgetegtaga 
169 tttetateet egaggtctcc gagagtttct ttttcagttg agttttgggt tgtteggeet 
171 cttttagagt ttetgtgggt gtetctctgt taggeagtea ctaagatcce eageeeeage 
173 cagaaagetg tgaaaettea agtcctatgg eggggaggac tggaatgtae eeeagteete 
175 tegaeecgac tgeagateag gtteeteeee tgateetett eteataeeet gtgaccteac 
177 caggttatee eettgtegte atggttacag agagcttgca gctgccatet taaaegtget 
17 9 ctttggggga gageeeaeet aaeaggagga ttttggtttg gaggtgcccc teetgaaaaa 
181 gtaggtggge aaaggetttc tetgggatea aatteaaata aateaagtat ttattgaatg 
183 ettaatatgt gcaaggcetg gtgeetagaa geeaegagaa agaatttata aeaggaeaga 
185 agtccetaaa etaaaeatee aeaggeeeee aatetaggag gtttcactce atteeagtga 
187 ettttaaagc egetttgtge etttgaaatg eettteetga gatttttgga tettcctgtt 
189 etgtcccetg eteettetag geetcaagat aaagggtaaa geeatggagt ctgggaagag 
191 eataacgteg ttgaegggat egtecetttg tggaatettt ettttttttt taatttaata 
193 aataaaagtt cgattteaaa aaaaaaaaaa aaaaaaaaaa aaaaaaa 

196 <210> SEQ ID NO: 2 

197 <211> LENGTH: 428 

198 <212> TYPE: PRT 

199 <213> ORGANISM: Mus museulus 

201 <400> SEQUENCE: 2 

202 Met Ala Ser Ser Leu Leu Glu Glu Glu Ala His Tyr Gly Ser Ser Pro 

203 15 10 15 
205 Leu Ala Met Leu Thr Ala Ala Cys Ser Lys Phe Gly Gly Ser Ser Pro 



1473 
1533 
1593 
1653 
1713 
1773 
1833 
1893 
1953 
2013 
2073 
2133 
2193 
2253 
2313 
2373 
2433 
2493 
2553 
2613 
2673 
2733 
2793 
2853 
2913 
2960 



206 



20 



25 



30 



208 Leu Arg Asp Ser Thr Thr Leu Gly Lys Gly Gly Thr Lys Lys Pro Tyr 

209 35 40 45 

211 Ala Asp Leu Ser Ala Pro Lys Thr Met Gly Asp Ala Tyr Pro Ala Pro 
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212 



50 



55 



60 



214 Phe Ser Ser Thr Asn Gly Leu Leu Ser Pro Ala .Qly Ser Pro Pro Ala 
■ 215 65 70 75 80 

217 Pro Ala Ser Gly Tyr Ala Asn Asp Tyr Pro Pro Phe Pro His Ser Phe 

218 85 90 95 

220 Pro Gly Pro Thr Gly Ala Gin Asp Pro Gly Leu Leu Val Pro Lys Gly 

221 100 • 105 110 

223 His Ser Ser Ser Asp Cys Leu Pro Ser Val Tyr Thr Ser Leu Asp Met 

224 115 120 125 

226 Thr His Pro Tyr Gly Ser Trp Tyr Lys Ala Gly lie His Ala Gly lie 

227 130 135 . 140 

229 Ser Pro Gly Pro Gly Asn Thr Pro Thr Pro Trp Trp Asp Met His Pro 

230 145 150 155 160 

232 Gly Gly Asn Trp Leu Gly Gly Gly Gin Gly Gin Gly Asp Gly Leu Gin 

233 165 170 175 

235 Gly Thr Leu Ser Thr Gly Pro Ala Gin Pro Pro Leu Asn Pro Gin Leu 

236 180 185 190 

238 Pro Thr Tyr Pro Ser Asp Phe Ala Pro Leu Asn Pro Ala Pro Tyr Pro 

239 195 200 205 

241 Ala Pro His Leu Leu Gin Pro Gly Pro Gin His Val Leu Pro Gin Asp 

242 210 215 220 

244 Val Tyr Lys Pro Lys Ala Val Gly Asn Ser Gly Gin. Leu Glu Gly Ser 

245 225 230 235 240 

247 Gly Ala Ala Lys Pro Pro Arg Gly Ala Gly Thr Gly Gly Ser Gly Gly 

248 245 250 255 

250 Tyr Ala Gly Ser Gly Ala Gly Arg Ser Thr Cys Asp Cys Pro Asn Cys 

251 260 265 270 

253 Gin Glu Leu Glu Arg Leu Gly Ala Ala Ala Ala Gly Leu Arg Lys Lys 

254 275 280 285 

256 Pro lie His Ser Cys His lie Pro Gly Cys Gly Lys Val Tyr Gly Lys 

257 290 " 295 . 300 

259 Ala Ser His Leu Lys Ala His Leu Arg Trp His Thr Gly Glu Arg Pro 

260 305 310 315 320 
2 62 Phe Val Cys Asn Trp Leu Phe Cys Gly Lys Arg Phe Thr Arg Ser Asp 
263 325 330 335 

265 Glu Leu Glu Arg His Val Arg Thr His Thr Arg Glu Lys Lys Phe Thr 

266 340 345 350 

268 Cys Leu Leu Cys Ser Lys Arg Phe Thr Arg Ser Asp His Leu Ser Lys 

269 355 360 365 

271 His Gin Arg Thr His Gly Glu Pro Gly Pro Gly Pro Pro Pro Ser Gly 

272 370 375 380 ' 

274 Pro Lys Glu Leu Gly Glu Gly Arg Ser Val Gly Glu Glu Glu Ala Asn 

275 385 390 395 400 

277 Gin Pro Pro Arg Ser Ser Thr Ser Pro Ala Pro Pro Glu Lys Ala His 

278 405 410 415 
280 Gly Gly Ser Pro Glu Gin Ser Asn Leu Leu Glu lie 

• 281 420 425 

285 <210> SEQ ID NO: 3 

286 <211> LENGTH: 14 
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287 


<212> TYPE: PRT 
























288 


<213> ORGANISM: 


Mus 


musculus 




















<400> SEQUENCE: 


3 






















291 


Ala His Gly Gly 


Ser 


Pro 


Glu Gin 


Ser 


Asn 


Leu 


Leu 


Glu 


He 






9 Q9 


1 


5 








10 














OQC. 
Z? ^ 


<210> SEQ ID NO; 


: 4 






















^ -7 D 


<211> LENGTH: 85 






















9Q7 


<212> TYPE: PRT 
























9 Qf^ 
^ J? 0 


<213> ORGANISM: 


Mus 


musculus 


















o u u 


<400> SEQUENCE: 


4 






















J U X 


lie His Ser Cys 


His 


He 


Pro Gly 


Cys 


Gly Lys 


V ax 


iyr 


Pit? 
uxy 


Lys 


/ixa 


J u ^ 


1 


5 








10 










X 0 




JU 4i 


Ser His Leu Lys 


Ala 


His 


Leu Arg 


Trp 


His 


Thr 


^-Ly. 


IjXU 


Arg 


Pro 


irne 


J U 3 


20 








25 










ou 






n T 


Val Cys Asn Trp 


Leu 


Phe 


Cys Gly 


Lys 


Arg 


Phe 


i nr 


Arg 


Ser 


Asp 


LjXU 


O U 0 


35 






40 










fi o 








O J- u 


Leu Glu Arg His 


Val 


Arg 


Thr His 


Thr 


Arg 


Glu 


Lys 


Lys 




1 XIX 


L^y o 


J X X 


50 






55 








oU 










O X o 


Leu Leu Cys Ser 


Lys 


Arg 


Phe Thr Arg 


Ser Asp 


nx b 


Leu 


Clear* 
OCX 


Lys 


nx o 


O X 4 


65 




70 








75 










o U 


O X D 


Gin Arg Thr His 


Gly 






















TIT 
JX / 




85 






















■^9 0 
U 


<'210> SEQ ID NO; 


: 5 






















X 


<211> LENGTH: 244 






















TOO 


<212> TYPE: PRT 
























oz o 


<213> ORGANISM: 


Mus 


musculus 


















■^9 


<400> SEQUENCE: 


5 






















JZ o 


Phe Gly Gly Ser 


Ser 


Pro 


Leu Arg 


Asp 


Ser 


Thr 


Thr 


Leu 


bxy 


Lys 


oXy 


^97 
oz / 


1 


5 








10 










1 R 

X o 




"5 9 Q 


Gly Thr Lys Lys 


Pro 


Tyr 


Ala Asp 


Leu 


Ser 


Ala 


Pro 


Lys 


1 nr 


JMieti 


P 1 ^ 7 

(aXy 


o o u 


20 








25 










J u 






J jZ 


Asp Ala Tyr Pro 


Ala 


Pro 


Phe Ser 


Ser 


Thr 


Asn 


Gly 


Leu 


Leu 


oer 


irro 


o o o 


35 






40 


















O O 3 


Ala Gly Ser Pro 


Pro 


Ala 


Pro Ala 


Ser 


Gly 


Tyr 


Ala 


2i o -n 




i yr 


t X U 




50 






55 








60 










oo o 


Pro Phe Pro His 


Ser 


Phe 


Pro Gly 


Pro 


Thr 


Gly Ala 


r 1 n 

oX 1 1 




IT X U 


V3X y 


OO J? 


65 




. 70 








75 










R n 

O \J 


o f± X 


Leu Leu Val Pro 


Lys 


Gly 


His Ser 


Ser 


Ser 


Asp 


Cys 


Leu 


IT ro 


oer 


V ax 


342 




85 








90 










95 




344 


Tyr Thr Ser Leu 


Asp 


Met 


Thr His 


Pro 


Tyr 


Gly 


Ser 


Trp 


Tyr 


Lys 


Ala 


345 


100 








105 










110 






347 


Gly He His Ala 


Gly 


He 


Ser Pro 


Gly 


Pro 


Gly Asn 


Thr 


Pro 


Thr 


■ Pro 


348 


115 






120 










125 








350 


Trp Trp Asp Met 


His 


Pro 


Gly Gly Asn 


Trp 


Leu 


Gly 


Gly 


Gly 


Gin 


Gly 


351 


130 






135 








140 










353 


Gin Gly Asp Gly 


Leu 


Gin 


Gly Thr 


Leu 


Ser 


Thr 


Gly 


Pro 


Ala 


Gin 


Pro 


354 


145 




150 








155 










160 


356 


Pro Leu Asn Pro 


Gin 


Leu 


Pro Thr 


Tyr 


Pro 


Ser 


Asp 


Phe 


Ala 


Pro 


Leu 


357 




165 








170 










175 
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